DIGITAL PROCESSING OF ACOUSTIC SIGNALS 
WITH APPLICATION TO AN 
ASW SIGNAL PROCESSOR 



Vance Erick Adler 




Naval Postgraduate Scnool 
Monterey, California 93940 




Monterey, California 




DIGITAL PROCESSING OF ACOUSTIC SIGNALS 
WITH APPLICATION TO AN 
ASW SIGNAL PROCESSOR 
by 

Vance Erick Adler 

Thesis Advisor: G. A. Rahe 

December 1973 



Approved loK public releas c; distribution unlimited. 



Digital Processing of Acoustic Signals 
With Application to an 
ASW Signal Processor 



by 



Vance Erick Adler 
Lieutenant, United States Navy 
B.S., San Jose State College, 1967 



Submitted in partial fulfillment of the 
requirements for the degree of 



MASTER OF SCIENCE IN AERONAUTICAL ENGINEERING 



from the 

NAVAL POSTGRADUATE SCHOOL 
December 1973 



Library 

'Wai Postgraduate School 
Monterey, California 93940 



ABSTRACT 

There is a growing need within the Navy for methods of detecting 
discrete narrowband signals in a non-stationary background. This paper 
concerns itself with the application of digital processing and spectral 
analysis techniques toward that goal. The use of the fast Fourier 
Transform in estimating the power spectrum of a signal is described. 

The method involves sectioning the time record, making "raw" estimates 
of the spectrum from these sections, and averaging these "raw 11 estimates. 
It is shown that more stable estimates are available if the segments are 
overlapped and an optimum amount of overlap for the case of the Hanning 
Window is found. It is shown that the stability of these spectral esti- 
mates can be interpreted as processing gain in the case of a discrete 
narrowband signal in additive noise. And finally, a brief description of 
signal detection theory applied to a human observer is presented to empha- 
size the flexibility that a human operator can bring to a signal detection 
system. 
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I. INTRODUCTION 



Today* s ASW signal processing systems continue to grow in complexity 
and while many functions will be automated in systems now under develop- 
ment, the human operator will continue to play a key role in the proces- 
sing of data and the making of decisions. Man provides a flexibility to 
the system that is not available through complete automation and it is, 
therefore, imperative that he be considered an integral part of the 
system. 

In the past, assumptions have been made regarding the requirements 
that could be imposed upon the operator, without any precise knowledge 
of how an operator f s performance actually would be influenced by the sus- 
tained demands of a system. In general, it apparently has been assumed 
by system designers that the operator was like any of the hardware com- 
ponents that go into a system, i.e. that he would continue to perform at 
a level throughout a mission approximately equal to that at which he 
began. Common sense does not support this assumption, but to assume 
otherwise introduced difficult problems for which no useful answers 
were available. It seemed wise, or at least expedient, to let the 
characteristics of the electronics determine the direction of design and 
the development of procedures for the use of the system, with only casual 
consideration of the characteristics of the operator. However, studies 
of the role of the operator indicate that the rate of degradation of 

operator efficiency, and therefore system effectiveness, is a function 
of time. And it is much more rapid than even the most critical "user 1 ’ 
or the most knowledgeable designer would have thought reasonable when 
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the systems were much less complicated, and demands on the operator were 
much less drastic than at present. 

The quality of the operator's decision is based, to a great extent, 
on his insight and understanding of the problem. Thus, the primary objec- 
tive of a system which utilizes a human observer is to provide the oper- 
ator with the necessary insight and understanding of the problem to enable 
him to make the best decision possible. To do this, the display should 
utilize natural qualities of human perception so as not to drain the 
operator psychologically or expect him to make dicisions based on infor- 
mation he is not able to fully perceive. 

The quality of the operator's decision is also based on the accuracy 
with which the information provided him actually represents the physical 
situation. The purpose of the passive A SW processor is to detect the pre- 
sence of certain narrowband signals with random phase variation from a 
background of random ocean noise. It has been shown that the optimum 
receiver, in a maximum likelihood sense, for detection of such signals 
is an infinite set of Fourier filters followed by incoherent integration 
integration of power and a decision element (Williams, 1972). The pre- 
sent level of technology indicates that digital processors offer the high 
resolution, high dynamic range, repeatability, and flexibility necessary 
to effectively produce such a receiver. 

The purpose of this paper is to discuss the digital processing of 
acoustic signals with application to a new generation of passive ASW sign- 
al processors being developed today. Specific mention will be made of 
techniques in the "Spotlight 11 processor under development at the Naval 
Postgraduate School by Professor G.A. Rahe. 

General digital processing considerations are discussed. The statis- 
tical properties of the spectral estimate are presented as well as methods 
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for improving them. The processing gain available from a system, which 
utilizes the FFT as a filtering device and incoherently integrates the 
power, is derived and finally a brief description of detection theory 
applied to a human observer in a sequential observation task is discussed. 
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II. DATA ANALYSIS 



A. NOISE REPRESENTATION 

The ambient acoustic wave fields in the ocean are often quite ener- 
getic and contain a wide range of frequencies. The study of these noise 
fields forms an important and necessary prelude to any serious design of 
detection systems. In the ocean, for example, one finds biological 
noise sources, noises due to wave action at the surface of the water, 
noises produced by meteorological conditions such as precipitation, and 
a great deal of shipping noise. For passive detection systems, these 
noise fields furnish the background out of which signals must be detected. 
From the experimental work of Parkins (1968) it seems reasonable to as- 
sume that the ocean noise field may, at least a large percentage of the 
time, be modeled as a weakly stationary, Gaussian random process. 

In order to understand the theoretical concepts of signal processing 
it is necessary to characterize the ambient noise field (and the desired 
signals) in commonly understood mathematical terms. The frequency domain 
(spectral) representation in which the "signal" is decomposed into a set 
of sinusoidal components as specific frequencies will be considered here. 
Select a particular sample function of noise and select from that 

sample an interval of duration L extending, say, from t = -L/2 to t = L/2. 

( s ) 

Such noise sample n (t) is shown in figure 2.1(a). Now generate a 
periodic waveform in which the waveform in the selected interval is re- 
peated every L seconds as in figure 2.1(b). This periodic signal 
( s ) 

n (t) with period L can be represented as a sum of harmonically re- 
L 

lated sinusoids (Fourier series) as follows: 
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where is a DC level; and <f>^ are, respectively, the amplitude and 

phase of each sinusoid (line); and f (the fundamental frequency) is 1/L. 
Plots of A^ and <f>^ as functions of frequency are referred to as amplitude 
and phase spectra, respectively. A^ and are given by 



and 
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(a k and t> k are Fourier coefficients). The DC level a^ is obtained by 



(g ) 

calculating the time average of n (t): 

Li 



a = A = — 
o o L 



L/2 

n L (s) (t)dt 



( 2 . 6 ) 



-L/2 



Several examples of amplitude and phase spectra of periodic signals are 
shown in figure 2.2. 

Although not all periodic signals can be represented by Fourier har- 
monic series, the class which can be is large. The limitations of the 
representation are discussed in Thomas and in Jenkins and Watts. Several 
other forms of Fourier series are also discussed in the literature. The 
form of eqn. 2.1 is a modified trigonometric version and is useful from 
an intuitive standpoint. Another form, which is more convenient for 
mathematical manipulations, is the exponential or complex version in 
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Figure 2.1. (a) A sanple noise waveform, (b) A periodic 
waveform is generated by repeating the interval in (a) 
from -L/2 to L/2. 
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Figure 2.2. Amplitude and phase spectra of some periodic 
signals. 
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( s ) 

which the signal n (t) is given by 



n 



L 




(2.7) 



where 



L/2 




9 • * * 



( 2 . 8 ) 
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The coefficients C are complex and have the property that 




(2.9) 



(i.e. and are complex conjugates). These coefficients are related 

to the coefficients in equation 2.1 by 



The exponential spectrum of the Fourier series produces a two-sided 
spectrum, each of half amplitude, one at frequency f and one at frequency 
-f. Although equation 2.7 indicates that the representation includes 
negative frequencies, this is not of physical significance because, in 
terms of this representation, the sum of the components at the positive 
and negative frequencies produces a real (in a mathematical sense) 
quantity. 

Frequency domain representations of non-periodic signals require a 
generalization of the Fourier series approach. One might intuitively 
reason that L approaches infinity for a nonperiodic signal, which implies 
that the fundamental frequency (1/L) approaches zero. If this reasoning 
is carried out mathematically, it leads to a representation of the sig- 
nal as a continuum of frequencies rather than a "line* 1 spectrum. The 
continuous representation (Fourier integral or Fourier transform) is 



C = A = a 



o o o 



( 2 . 10 ) 




( 2 . 11 ) 
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given by 



00 



n(t) = f N(f)e :j27rft df 



( 2 . 11 ) 



wne re 



00 



N(f ) = f n(t)e j2lrft dt 



( 2 . 12 ) 



— 00 



In general, N(f) is a complex quantity that can be represented by its 
amplitude and phase components as follows; 



Plots of N(f) and (}>(f) as functions of frequency are called ampli- 
tude and phase spectra, respectively, and are analogous to the discrete 
and cf^. Several examples of the amplitude and phase spectra of non- 
periodic signals are shown in figure 2.3. Note the similarity between 
the continuous spectra and the envelope of the line spectra in figure 
2 . 2 . 

It should be noted, however, that not all time functions possess a 
Fourier transform. The condition for the existence of the Fourier trans- 
form is that the Fourier integral exist in the limit as T-* )0 . Existence 
is guaranteed by absolute convergence of the integral of equation 2.12, 



A number of functions having great usefulness do not meet this re- 
quirement; functions such as sine waves or step functions. These func- 
tions are nevertheless being handled by allowing the Fourier transform 
to contain impulses. In practice, the class of signals which are Fourier 
transformable is sufficiently large to encompass a wide variety of ap- 
plications . 




(2.12a) 



00 



|n(t) | 2 dt < 00 



(2.12b) 
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Figure 2.3. Amplitude and phase spectra of some nonperiodic 
signals. 
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Fourier series and Fourier transforms of signals differ in their 
theoretical solutions attributable primarily to the fact that, for 
practical purposes, only finite length signals can actually be sampled 
and computed. The results of this computation are used only to provide 
estimates of the theoretical solutions. As seen in equation 2.7, the 
series method requires the sum of an infinite number of discrete sinu- 
soids and the integral method requires an infinitesimal frequency re- 
solution. Therefore, only approximations to these representations are 
obtained in practice. The discrete Fourier Transform (DFT) , which will 
be discussed later, approximates a Fourier series by arbitrarily forcing 
the signal to be periodic and then truncating it. to a finite number of 
terms . 

B. DIGITAL SIGNAL PROCESSING 

During the decade of 1960-1970, an important new approach to wave- 
form manipulation, or signal processing, came into importance. It be- 
came practical to represent information-bearing waveforms digitally, that 
is, symbolically, and to do signal processing on the digital representa- 
tion of the waveform. Thus all the flexibility of digital computers 
became available for processing of waveforms, and this has had an effect 
on all fields in which waveform processing is important. It became ap- 
parent that one could manipulate waveforms digitally in ways that would 
be totally impractical with continuous representations. One advantage 
in this respect is the freedom from time constraints when a waveform is 
in a digital memory. Once a waveform is in a computer memory, it may 
be considered to have any desired time base associated with it. As a 
result of the Fast Fourier Transform (FFT) algorithm, it became practical 
to do signal processing on waveforms in either the time or the frequency 
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domain, something never practical with continuous systems. The Fourier 
transform became not just a theoretical discription, but a tool; and 
real-time digital spectral analysis became a reality. Some of the 
practical considerations of performing this processing will be discussed 
in this section. 

1. Sampling Considerations 

Digital signal processing is based on the principle that an ana- 
log signal can be represented by the values of the signal at discrete 
points in time. This principle is embodied in the sampling theorem, 
which states that a band-limited signal is completely characterized by 

a set of samples taken at a minimum rate of 2f (called the Nyquist 
r max 

rate), where f ^ represents the highest frequency component in the 
signal. Although this statement of the sampling theorem appears straight- 
forward, there are several practical considerations which should be dis- 
cussed. 

One difficulty in applying the sampling theorem is that actual 
signals do not occupy well defined frequency bands, i.e., the spectra 
of real signals tend to gradually roll off until the amplitude of the 
spectral components reaches a minimum detectable level. Therefore, most 
sampling systems have a filter just before the quantizing device to 
limit the signal to a predetermined band. This filter is called an anti- 
aliasing filter. 

When analog data are quantized i.e., converted from a continuous 
waveform to discrete values, at a rate of f g samples per second, the ana- 
log spectrum is divided into frequency bands of width f /2 (figure 2.4a). 
During sampling, the energy in each band is folded or aliased into the 

lowest band (0 to f /2) and f /2 is called the folding frequency. The 

s s 
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folding operation can be envisioned if the bands in figure 2.4a are 
viewed from "above", as in figure 2.4b. The original analog spectrum is 
collapsed into the lowest frequency band, and the even-numbered bands are 
inverted in frequency as they are folded. Therefore, the net result of 
the sampling process is that, in spectra computed from the digital sam- 
ples, all energy in the original analog spectrum appears in the band 
0 to f^/2 as seen in figure 2.4c. Figure 2.5a shows an example of a 
typical aliasing situation. It is obvious that the spectrum computed 
from the digital samples (figure 2.5b) provides very little information 
concerning the actual frequency content of the original signal because 
the original location of the computed spectral components is unknown. 

Only by containing the original spectrum in a single known band can 
the frequency content be unambiguously determined. 

Typically we think of two types of antialiasing filters, the 
low-pass filter and the band-pass filter. Both have application in 
digital signal processing and will be briefly discussed here. 

The low-pass anti-aliasing filter will be assumed to have a 
frequency response function as shown in figure 2.6. The amplitude level 
of the aliased components is given by 



A a = \ 



log 10^ f s“ f h^ Iog 10 f h 

log io 2 



" 3 * 32 “h 1Og 10 (f s /f h ' 



db 



(2.13) 

(2.14) 



where f g is the sampling frequency, f^ is the filter cutoff frequency, and 
is the filter rolloff slope (db/octave). Equation 2.14 applies when 
the original analog signal spectrum is assumed to have components at 
frequencies that are above the folding frequency and equal in amplitude 
to components in the pass band. 
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Band 1 
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Band 3 


Band 4 














0 V 2 f = 3 V 2 2f s 



(a) Effect of sampling the original analog spectrum 




(b) Folding operation viewed from "above". 



0 f s /2 

(c) Bandwidth of tho spectrum computed from the digital 
samples. 



Figure 2.4. Aliasing effect of sampling on the Fourier 
spoctrum. 
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(a.) Original analog spectrum 
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V 2 f = 

(b) Spectrum computed iron digital samples 



Figure 2.5. Typical antialiasing situation 



filter response 




f = sampling frequency 

3 

f^= filter cutoff frequency (-3 db) 
1.1^= filter rolloff slope (db/octave ) 
A & = amplitude of aliased component 

Figure 2.6. Low-pass antialiasing filter 
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High-quality variable-cutoff commercially available filters 
generally have rolloff slopes from -12 to -48 db/octave. Sample fre- 
quencies usually are approximately three to four times the filter cut- 
off frequency and in order to provide a margin of safety from aliasing 
effects, an anti-aliasing filter slope of at least -36 db/octave and 
preferrably -48 db/octave are desirable. These figures apply to situa- 
tions in which a variety of general-purpose signal processing techniques 
will be applicable. Less conservative parameters can be substituted as 
the situation dictates. 

A bandpass signal of bandwidth W and center frequency f can be 
sampled at a rate greater than or equal to twice the bandwidth rather 
than twice the highest frequency component, if appropriate antialiasing 
filtering is performed. The bandpass antialiasing filter will be assumed 
to have the frequency response function shown in figure 2.8. Once the 
sample frequency has been chosen, the level of the components aliased 
from above the band of interest (assuming they are equal in amplitude 
to the passband components) is given by 

A sh ■ 3 ' 32 «h loe 10 (K V f h - » db < 2 ' 15 > 

where f^ is the upper filter cutoff frequency, is the filter slope 
(db/octave), f is the sample frequency, and K is the band number (see 
figure 2.7). The level of the components aliased from below the band 
of interest is given by 

A sSi = -3.32 M a log 10 [(K-l)f s /f £ - 1] db (2.16) 

where f and M are the lower filter cutoff and slope, respectively. 

At At 

The procedure for choosing the sample frequency in bandpass sit- 
uations is not as straightforward as in the low-pass case. However, a 
fairly simple iterative procedure is as follows. 
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Figure 2.7. Sampling of a bandpass signal 




Figure 2.8. Bandpass antialiasing filter 
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1. Define an effective signal bandwidth W 1 that is about two 



to four times the actual bandwidth of interest, depending on the steepness 
of the filter rolloff. 

2. Select a sampling band by choosing K to be the largest in- 
teger such that K < f^/W 1 + 1/2. This will guarantee that the signal 
band W 1 will be encompassed by a sampling band. 

3. Define the sample frequency to be 

2f 

f s = K-l/2 (2.17) 

4. Calculate values for A', and A A . If these values are not 

sh s£ 

acceptable, repeat steps 1 through 4 using a larger effective bandwidth W T . 

5. If it is inconvenient to generate the exact sample frequency 

calculated in eq. 2.17, it may be adjusted as needed. Recalculation of 

A and A will indicate if a significant amount of aliasing will result 
s n s 

from the adjustment. 

The effect of sampling properly filtered bandpass signals at a 

rate proportional to the bandwidth is the bandshift the signal spectrum 

to the baseband (figure 2.4c). This means that when spectra are computed 

with digital samples the frequency axis must be labeled in accordance 

with the location of the original signal spectrum. This location is a 

function of the sampling band number K and sample rate f . For odd- 

s 

numbered bands, the frequency axis should be labeled from (K-l)f /2 to 

s 

kf g /2 as shown in figure 2.9a. For even-numbered bands, the scale runs 

from Kf to Kf /2 (figure 2.9b). Note that in the even-numbered bands, it 
s s 

is desirable to rotate the plot about the vertical axis and translate the 

frequency axis such that it runs from Kf / 2 to Kf . 

s s 
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Figure 2.9. Relabeling of frequency axis of computed 
spectra 
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2 . Dynamic Range and Quantization Error 



The dynamic range of an analog-to-digital (A/D) converter can 
be defined as the ratio between the maximum and minimum peak signal 
values or as the signal-to-quantization noise ratio, depending on the 
usage of the digital samples. The maximum peak signal range in db 
(figure 2.10) is given by 



20 < 2 - is > 

max' 

= 6 . 02 (n-1) (2.19) 

where V is the maximum input voltage level and n is the number of 
max 

bits (including sign) of the bipolar A/D converter. For example, a 12- 

bit bipolar A/D converter has a peak signal range of about 66 db, which 

is compatable with most analog instrumentation devices, e.g., tape re- 
corders . 

Quantization noise is caused by the stepwise approximation of 
the digital samples to the original continuous signal and can be treated 
as an additive noise component (figure 2.10). The RMS quantization 
noise is given by 

'V 

N = 0.29 (2.20) 

Q 2 n-l 

The RMS signal-to-quantization noise ratio (assuming a sine wave 
of peak amplitude v max ) in db is given by 



SN n = 20 log n V max / ^ 

H - LU n _l 

.29 V /2 

max 

2 n_ l 

20 l°g 10 .4! 

= 20 log 10 2 n_1 + 7.74 



( 2 . 21 ) 



( 2 . 22 ) 



D + 8 
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(2.23) 

(2.24) 
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v/ 
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Figure 2.10. Effects of quantizing an analog signal 
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Therefore, a 12-bit bipolar A/D converter will have a signal-to-quanti- 
zation noise ratio of about 74 db. 

Eqs. 2.19 and 2.24 both yield optimistic numbers, i.e., it is 
usually not possible to sufficiently control the amplitude of the input 
signal to the A/D converter so that the full voltage range is used. 

This means that the dynamic range can be reduced by 3 to 6 db, depending 
on the variability of the input signal. A buffer amplifier prior to 
the A/D converter is usually needed to condition the signal level. 

3. Discrete Fourier Transform (DFT) 

When analyzing signals whose parameters are known within broad 
limits, one of the first operations to be performed is some type of 
spectral analysis. This can be a single analysis, a series of analyses 
to indicate the time dependence of the spectral components, or an aver- 
aged spectral computation to approximate a power spectral density func- 
tion. This section discusses some of the considerations for using 
digital Fourier Transform techniques to analyze sampled signals. 

The DFT computes a finite Fourier series to represent a signal 
over a pre-defined time interval. The time interval L is chosen to 
include as much of the original signal as desired (figure 2.11a). The 
DFT, since it is a representation applicable to periodic signals, treats 
the signal as if it were periodic with period L (figure 2.11b). The 
discontinuities which may appear because of this forced periodicity are 
included in the representation and are of primary concern when performing 
DFT analyses. 

The DFT representation is closely analogous to the frequency 
domain representation for periodic signals (eqs. 2.1 through 2.6); how- 
ever, it is necessary to express the mathematical operations in terms of 
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(a) Signal truncation 




f ' 



Figure 2.11. DFT processing 
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sampled signal values. It will be assumed that the signal to be repre- 
sented or analyzed consists of a set of N equally spaced samples over 
the time interval L as follows: 

s(t i ) = s (i • At ) i = 0,1,2, .. . ,N-1 (2.25) 

where At is the time between samples. The sample spacing is given by 



At 




1 

f 



s 

where f g is the sample frequency, 
signal s(t) over the interval L is 



(2.26) 



The representation of the original 



N-l „ 

s(t) = A + V A cos(27rnAft + 0) ;0<t<L (2.27) 

o n n y — — 

n=l 

where 

Af = 1/L (2.28) 



the fundamental frequency or "bin" spacing. The amplitude and phase 
spectra are 



A A O A n 

A = / a + b 



n 



n n 



A 1 A A 

6 = -Tan (b /a ) 

n n n 



where „ 2 N-l 



a = — y-s t. cos (2unAf t . ) : n 
n N 1 i ’ 

i=l 



1,2,..., N-l 



N-l 



b •= — 7 st. sin(2frnAft . ) ; n 

n N . u . l l ’ 

1=1 



= 1,2,..., N-l 



The DC term A is 
o 



N-l 



= A = ± l s(t.) 

° N i=0 1 



(2.29) 

(2.30) 

(2.31) 

(2.32) 



The terms a^ and b^ are commonly called the real and imaginary Fourier 
coefficients , respectively. 
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When the signal to be analyzed by DFT methods is real, the am- 
plitude and phase spectra have certain symmetrical properties. Mathe- 
matically, the symmetry is given by 



A (N/2)+(i-l/2) A (N/2)-(i-l/2) 5 1 1 > 2 >---> N / 2 



(2.34) 



which states that the amplitude spectrum has even symmetry about the 
point (N-l)/2. The phase spectrum exhibits odd symmetry about the 
point (N-l)/2. i.e. 



, (N/2)-(i-l/2) 0 (N/2)+(i-l/2) 



; i = l,2,...,N/2 



(2.35) 



In terms of frequency, the point (N-l)/2 can be expressed as f g /2, the 
folding frequency. The portion of the computed spectrum between (N-l)/2 
and N-l can be treated as the negative half of the spectrum which would 
normally occupy the band -(N-l)/2 to 0. For real signals, only the values 
between 0 and (N/2)-l must be considered* 

Signal truncation introduces broadband leakage into the computed 
spectra, i.e* seemingly narrowband signals will appear to occupy more 
bandwidth than expected. Severe discontinuities in the pseudoperiodic 
signal may be present, depending on the relationship of the signal fre- 
quencies to the length of the period L (see figure 2.11). This can most 
easily.be illustrated for the case of a sinusoidal signal. If the signal 
frequency f is such that 

f = KAf ; K = 1,2,..., (N/2)-l (2.36) 

(f is harmonically related to Af) the computed amplitude spectrum will 
consist of a single line component (figure 2.12a). If the signal fre- 
quency is offset by the amount Af/2, the spectrum shown in figure 2.12b 
is obtained. The apparent spectral width of this signal depends more on 
the truncation effects of the DFT calculation than on the actual signal 
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(a) Sine wave harmonically related to Af 





(b) Sine wave midway between two DPT frequency bins 



Figure 2.12. DPT amplitude spectra 

t 
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bandwidth. The solution to this problem is to apply a ’’smoothing” 
window to the input data to eliminate or minimize the discontinuities. 
Windowing functions will be discussed in the next section. 

Although the subtleties of the DFT implementations are beyond 
the scope of this discussion, there are some implementation considera- 
tions with which the analyst must be acquainted. Until several years 

ago, spectrum analysis by digital techniques was an extremely time- 

2 

consuming process, i.e., computation times proportional to N , where 
N is the number of points in the transform. 

Presently, most DFT calculations are performed with fast Fourier 
transform (FFT) algorithms which have computation times proportional to 
N log^N (N is assumed to be a power of 2). The FFT has made it possible 
to perform digital spectral analysis calculations in reasonable lengths 
of time, making on-line digital spectral analyzers a reality. However, 
at least in small computer systems, these algorithms are generally im- 
plemented in fixed point arithmetic which is subject to overflow. Thus, 
considerable care must be taken to properly scale the FFT computations, 
especially in situations where the signal energy falls into a small num- 
ber of frequency bins.' 

These overflows are particularly troublesome because they gen- 
erally appear as spurious components at unpredictable frequencies. The 
best way to prevent overflow while still maintaining the dynamic ampli- 
tude range of the calculation if to perform dynamic scaling on inter- 
mediate computed values in the algorithm. All scaling factors should 
be stored in a way that will permit the computed output values to be 
calibrated with respect to the known input voltages. Experience in- 
dicates that this type of scaling, along with appropriate windowing, 
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provides a dynamic range (ratio of the peak spectral component to the 
computational noise floor) of at least 80 db when implemented on a com- 
puter with 16-bit words. 

4 . Window Functions 

In practice it is only possible to obtain finite lengths of 
records. The statistical questions to be considered later stem from the 
fact that it is necessary to estimate the accuracy of various functions 
obtained from finite amounts of data. From the preceding discussion on 
truncation effects and spectral leakage, it should be clear that the 
operation for taking a finite length record is equivalent to multiplying 
the actual signal s(t) by the data window w(t). The DFT actually uses 
samples from the function s f (t), which is given by 

s f (t) = w(t) s (t) (2.37) 

where s(t) is the original signal and w(t) is zero everywhere but on the 

interval 0 to L. In the frequency domain, eq. 2.37 becomes 

s L (f) = w(f) *s(f) (2.38) 

oo 

= | S (g)W(f-g) dg (2.39) 

_oo 

where * denotes convolution. This means the actual spectrum of s(t) 
is convolved with the Fourier Transform of the window function w(t) . 

The DFT, in effect, samples this result at integral multiples of 1/L. 
Therefore, the shape of the window function’s spectrum determines the 
degree of spectral spreading when a narrowband signal is truncated. 

Any reasonable data window w(t) will produce a spectral window 
W(f) which is concentrated about f=0 but with side lobes which damp out 
as f gets further away from zero. For small L, S (f) may give a very 

ti 

distorted picture of S(f) since the window W(f-g) will be wide and hence 
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values of S(g) far removed from g = f will contribute to S (f) in the 

J-i 

integral. As L becomes large the distortion will be reduced. Finally, 
as L tends to infinity, the transform component at frequency f can be 
fully determined, since data windows will tend to the generalized 
function 6 as L tends to infinity. Hence, as L tends to infinity, W(f-g) 
tends to a delta function centered at g = f and so S (f) tends to S(f). 

j_i 

In reality, of course, this ideal situation is impossible since we are 
always dealing with finite-length records. 

Although there are many different window functions in the lit- 
erature, the advantages of windowing the data may be adequately observed 
by noting the differences between the rectangular window (i.e. simply 
truncating the data) and the Hanning (Cosine) window. It should be 
noted, however, that the application of any data window is simply a 
tradeoff between the bandwidth of the main lobe and the attenuation of 
the side lobes (i.e. a spectral window with a narrow main lobe will have 
as slower rate of attenuation of the side lobes than a window with a 
wider main lobe) . This comparison can be seen for the rectangular 
window and the Hanning window in the next two sections, 
a. Rectangular Window 

The rectangular window is applicable when samples of the 
original signal are used directly by the DFT. This function is given by 



W(t) 



r 1 0 £ t £ L 

^0 elsewhere 



(2.40) 



(see figure 2.13a). The Fourier transform of w(t) is 



W(f) = Le 



i27TfL sin rfL 



(2.41) 



(see figure 2.13b and c) . The actual location of the sin x/x function, 
and hence the amounts of energy leakage, in relation to the DFT frequency 
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(a) Time domain 




(t>) Frequency domain with input signal harmonically 
related to l/L. 




between two bins. 



Figure 2 . 13 . Rectangular window 
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Figure 2 # 14* Rectangular Window 



bins, depends on the frequency of the input signal to which the window is 
applied. For an input signal frequency which is harmonically related to 
Af, the sin x/x function is sampled as shown in figure 2.13b and all sig- 
nal energy plus any other energy which might be present (noise) under the 
main and side lobes is reported as being present at the input signal fre- 
quency. For a signal frequency halfway between the two bins, the sin x/x 
function is sampled as shown in figure 2.13c and the signal energy is at- 
tenuated and spread throughout all the side-lobes giving a very distorted 
picture of the actual signal. The relative size of the side-lobe area 
compared with the main lobe area is very important when considering the 
amount of energy leakage. Figure 2.14 shows the spectrum of the rectan- 
gular-time window drawn to scale, 
b . Hanning Window 

The Hanning Window is adequate for general-purpose spectral 
analysis and is easily applied to data either in the time or frequency 
domain . 

The Hanning Window is defined by 



W(t) = { 



1/2(1 + cos 



-L/2 < t < L/2 
Elsewhere 



(2.42) 



(see figure 2.15). The Fourier Transform of W(t) is 



W(f) 



L J sinirf L 1 sin(TrfL-Tr) + 1 sin(Trfh+Tr) 



L / si 
” 2 \ TT 



fL 



2 (7rfL-7r) 



2 (TTfL+TT) 



■-> 



(2.43) 



L simrfL 
2 irfL(l-f Z L Z ) 



From equation 2.43 it is seen that the Hanning Window turns 
out to be the summation of three sin x/x functions. Since the main lobes 
are spaced 1/L Hz apart, this window has the effect of averaging three 
adjacent lobes, giving the center one twice as much weight as the side 
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ones. Thus, there will be a tendency to greatly decrease the size of 
the side lobes. The main lobe is reduced to one-half of its previous 
value and its width is doubled. The reduction in leakage, therefore, 
results in a corresponding widening of the bandwidth of analysis. The 
distance between the first zero crossing on either side of the main lobe 
is 2/L Hz. Figure 2.15 shows the spectrum of the Hanning window drawn 
to scale. 

C. PROCESSING OF REAL DATA 

To see how some of the ideas presented thus far are applied to real 
data, consider the method of processing being used in the new "SPOTLIGHT" 
acoustic signal processor being developed by Prof. G. A. Rahe at the 
Naval Postgraduate School, Monterey. 

The idea behind the processor is to present the observer with a 
continually updated history of the spectrum of the incoming signal and 
to allow him to interpret the data appropriately. The display of the 
spectrum is presented as a three-dimensional random surface on a CRT 
screen as shown in figure 2.16. Each successive update of the display 
represents an estimate of the incoming signal’s spectrum over a length 
of time T seconds long. As each new estimate is presented, the oldest 
one is dropped off the back of the display and a new one appears on 
the front end. Each spectral estimate is formed as follows: 

1. A data window w of duration L seconds is applied successively 

to the available data in the overlapping intervals (0,L), (S,S+L),... 

[(P-1)S,T], where S is the amount of shift each adjacent data window 
undergoes, and P is the total number of pieces or segments employed 

(see figure 2.17). 

2. A DFT is performed on each of the P windowed sections using 
an FFT algorithm and a raw estimate is computed as the magnitude squared 



of the DFT. 
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|G(f)l 2 




Figure 2.16. "SPOTLIGHT" processor display 




Figure 2.17. Overlapped data windows 
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3. The smoothed estimate is computed as the average of the P 
raw estiamtes. 

Since L seconds of data are used for each estimate we see that 

(P-l)S + L = T (2.44) 

Each estimate need not and most probably will not be independent of the 
previous estimate. The update rate of the estimates is a function of 
the operator’s needs. The choice of the parameters L, T, and S is dis- 
cussed in the next section. 

D. PRACTICAL CONSIDERATIONS 

The effectiveness of the processor depends on its ability to make 
consistently stable spectral estimated using intelligently chosen para- 
maters. Some of the more practical aspects of the spectral estimate 
are now discussed. The parameters associated with making the estimates 
must necessarily be chosen prior to the processing of any data and should 
be made to meet certain specifications. 

The sampling rate must be chosen so that the spectrum can be esti- 
mated in the frequency range of interest, 0 < f < f^. Hence, the sam- 
pling rate must be at least 2f^. Care must be taken to avoid aliasing 
as was discussed in section IIB. This is usually done with an anti- 
aliasing filter prior to sampling the signal so that the power above 
f^in the spectrum is effectively removed. 

Associated with reproducing the fine details of the spectrum is the 
fidelity o f the estimate. To achieve high fidelity, the ’’Bandwidth’ 1 of 
the window used must be of the same order as the width of the narrowest 
important detail in the spectrum. It is, therefore, useful to be able 
to make an intelligent estimate as to the width of the smallest impor- 
tant detail required by the system to be ’'detected”. For example, if 
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the width of the narrowest important detail in the spectrum is known to 
be .15 Hz, then as spectral window whose "bandwidth" is of the order of 
.15 Hz or smaller must be used to be able to resolve the fine details 
of the spectrum. Note that to resolve a spectrum to the same amount of 
detail, more data (a larger transform) would be required if using the 
Hanning window than if using the rectangular window since the "bandwidth" 
of the Hanning window is greater than that for the rectangular window 
(see figures 2.14 and 2.15). Assuming that the hardware is available, 
one should not be lulled into ignoring the above considerations and 
using the finest resolution available, because the requirements for high 
fidelity and high stability (to be discussed next) are conflicting and 
any practical realization is necessarily a compromise between the two. 

It seems appropriate at this time to say that the frequency "band- 
width" of a narrow band-pass filter (window) is described in several 
different ways. Three of the most common descriptions for bandwidth 
which are of interest to the power spectral density measurement are 
briefly described here. 

1. The half-power point bandwidth is defined as 



where 




(2.45) 



is the frequency interval between the upper and lower frequencies 
where the filter attenuates an applied signal 3 db below maximum trans- 
missibility. This description of bandwidth is convenient because it is 
easy to measure, but is otherwise of little physical significance for 
random signal applications. 
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2. The noise bandwidth B is defined as 

n 

J |w(f)| 2 df 

R = lil- 




(2.46) 



is the bandwidth of a hypothetical rectangular filter which would pass a 
signal with the same mean square value as the actual filter when the in- 
put is white noise. B is a convenient measure of bandwidth to use for 
r n 

normalizing narrowband mean square value measurements, as required for 
analog power spectral density measurements. 

3. The equivalent statistical bandwidth B^ defined as 



B 

s 



J n |w(f)| 2 df 

| |w(f) 1 4 df 

0 



(2.47) 



is the bandwidth of a hypothetical rectangular filter which would pass 
a signal with the same mean square value statistical error as the actual 
filter when the input is white noise. In other words, B^ is the descrip- 
tion of filter bandwidth which is technically appropriate when considering 
the statistical error e of a properly resolved power-spectral density 
measurement as given by 



(2.48) 

/b t 



e 

For the case of an ideal filter, the three descriptions of bandwidth 
are all equal. For any other filter characteristic, however, the band- 
width terms will generally be different. A comparison of these three 
definitions of bandwidth for the rectangular and Hanning data windows 
is seen below in table 2.1. 
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Data Window 


B B B 

r n s 


Rectangular 


.85 1 1.23 

L L L 




1.4 1.5 2.08 


Hanning 


L L L 



Table 2.1. Comparison of Bandwidth Definitions 



For finite records, the extent to which the width of peaks can be 
estimated and fine detain detected is also influenced by the variance 
of the estimate. Hence, to be able to trust the fine structure of the 
spectrum, it must be possible to tie down the estimate to a given 
stability. The estimator is said to have high stability if the variance 
of the smoothed spectral estimate is small. A convenient description of 
a spectral estimate is given by its "equivalent number of degrees of 
freedom", K, defined (Blackman and Tukey, 1968) as 



K = 2 (Average)' 
Variance 



(2.49) 



Since a small variance is associated with a stable estimate, the larger 
the equivalent number of degrees of freedom the more stable the estimate. 
It will be shown in section III that maximum stability is obtained by 
overlapping the data windows as shown in figure 2.17. The amount of 
overlap used to achieve the maximum stability is dependent upon the par- 
ticular data window used. For a fixed transform length L, windows with 
lower sidelobes have wider and smoother main lobes and therefore require 
less overlap to achieve maximum stability. 

In addition to the fidelity and stability of the estimated spectrum, 
there are some other important considerations to be made when choosing 
the parameters to be used in making the estimates. For instance, the 
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statistics of the background must be a factor in determining the length 
of time, T, used in the smooth estimates. A particular advantage to 
this method of obtaining the spectral estimates is that L may be chosen 
so that slow variations in the background do not affect the performance 
of the processor. If necessary, the frequency resolution of the esti- 
mate may be adjusted by appending zeros to the data before taking the 
transforms . 

Additionally, if the desired signal is unstable - i.e., fading in 
and out due to the effects of the propagation medium, propagation paths, 
or just being transmitted intermittently - and therefore not present for 
at least lengths of time equal to T, then it is possible that the inte- 
gration process will actually attenuate the signal rather than enhance 
it. Redundancy is the display will help alleviate the intermittent 
transmission problem by providing the operator with estimates which are 
not totally independent, but fade-in and fade-out problems due to propa- 
gation should be considered in choosing the integration time T. 
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III. THE SPECTRAL ESTIMATE 



In the development of modern communication theory, a vast body of 
knowledge involving various technical disciplines has been incorporated. 
The area of digital data analysis and measurement, for example, includes 
the widespread use of spectral analysis techniques made feasible by the 
availability of the fast Fourier transform algorithm. Increased emphasis 
on optimization and detection techniques also necessitates the use of 
modern statistical procedures. The very foundation of communication 
engineering includes such topics as probability theory, random processes, 
and information theory. This section will draw on what is necessary from 
the theoretical framework of modern communication theory to discuss the 
problem of estimating the power spectra of finite length records and 
determining their confidence level. 

A. STATIONARY STOCHASTIC PROCESSES AND ERGODICITY 

A process is termed deterministic if it does not contain any features 
of randomness; otherwise it is termed stochastic (or random). Spectral 
analysis is concerned mainly with a class of stochastic processes called 
stationary stochastic processes. A stationary stochastic process is one 
whose statistical properties do not change with time. Signals generated 
by a stationary process have the property that a long enough segment of 
any signal recorded in some time period has essentially the same statis- 
tics as another segment of the signal observed at some other period. 

That is, a signal from a stationary process is not tied down to any par- 
ticular origin in time. 

There are several definitions of stationarity in current use. A 
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process is said to be stationary in the strict sense if none of the 
statistics which characterize the stochastic process changes with time. 
Another definition of stationarity is wide-sense or weak stationarity . 

A process is said to be weakly stationary if the mean value and auto- 
correlation function do not vary with time. In other words, for the 
weakly stationary random process, the mean value is a constant and the 
auto-correlation function is dependent only upon the displacement t and 
not on the particular time chosen to compute the auto-correlation. 

If each signal of the ensemble of possible signals that can be gen- 
erated by a stationary process is typical of the ensemble as a whole, 
then the process is said to be ergodic. The statistics over a long time 
interval for any one signal are then the same as the statistics over the 
ensemble of all possible signals at any one time instant. Thus an ergod- 
ic process is one for which corresponding ensemble averages and time 
averages are equal. 

The class of ergodic processes is a subclass of the class of station- 
ary processes. That is, an ergodic process is necessarily stationary but 
a stationary process need not be ergodic. The application of theory to 
practice is very oftep simplified by making the assumption that a pro- 
cess is ergodic; however, except in special instances it would be diffi- 
cult or impossible to demonstrate that a physical process is ergodic 
other than by direct measurements and comparisons. 

B. DEFINITION 

The following discussion will examine the definition of the spectral 
estimate, the error involved in trying to make a spectral estimate from a 
finite length sample, and ways of improving this inherent error. To avoid 
confusion, the terms "spectral estimate' 1 or "estimate" in this discussion 
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refer consistently to the estimate of each individual spectral component 
within the bandwidth of interest, B; and the term "spectrum" will be used 
to refer to all the individual estimates collectively. 

In order to proceed with estimating the spectrum of ocean ambient 
noise, some assumptions about the properties of the noise will have to be 
made. It will be assumed in this discussion that the ocean ambient noise 
may be considered approximately Gaussian and at least weakly stationary. 
Some recent research has shown that these assumptions are reasonable 
(Parkins, 1968). 

In the context of digital signal processing, the power spectral den- 
sity or power spectrum of a signal is defined as the magnitude squared of 
the Fourier transform of the signal. Inherent in the definition of the 
Fourier transform, however, is the fact that a signal of finite length 
will have a transform which contains all frequencies and, conversely, 
a band-limited spectrum emanates from a signal which extends from t = - 00 
to t = + oo , Therefore, in order to talk about a finite spectrum, the 
definition of the true power spectrum of a record n(t) is defined as 

G(f) = lim h.\ |N(f,T).| Z ] (3.1) 

T-K» L J 

where N(f,T) is the finite Fourier transform of n(t), that is, 

T _.2nft 

N(f,T) = f n (t) e J dt (3.2) 

0 

An estimate of the true spectrum given in Equation 3.1 for a sample 
record of any length T may be obtained by simply omitting the limiting 
and expectation operators. This will yield an estimated spectrum of 

G(f,T) = | |N(f ,T) | 2 (3.3) 
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where the frequency resolution will be — . This estimate, as it stands, 
will be shown to be statistically unreliable, however. Methods of improv- 
ing the reliability will be presented in Section IIIE. 



C. STATISTICS OF THE SPECTRAL ESTIMATE 

As we discussed in Section II, N(f,T) is a complex quantity composed 

2 

of the real and imaginary Fourier coefficients and | N ( f , T) | is the sum 
of the squares of the Fourier coefficients. Now, since n(t) is consider- 
ed to be a Gaussian random process with zero mean, and since the Fourier 
transform is a linear operation, it can easily be shown (see Section IV) 
that the Fourier coefficients are independent Gaussian random variables 
with zero mean and equal variances. 

A 

Blackman and Tukey (Pg 21) show that each frequency estimate of G(f) 
is a chi-square variable with two ’’degrees of freedom”. The number of 
degrees of freedom, K, may be thought of as representing the number of 
independent or "free” squares entering into the estimate. Note that Equa- 
tion 3.3 describes the estimated spectrum and that each spectral compon- 
ent is a chi-square variable with two degrees of freedom. The probability 
density function of is given by 




( x ) - ( 2 k ' 2 rCk/2)] - 1 x ^ 2 - 1 ) e -*'' 2 



<3.«) 



where T(a) = x 3 1 e X dx (3.5) 

The mean and variance of the variable are 

E lX k 2] ' \2 " k <3 ' 6> 

Var [ X k 2 ] = o2 v 2 = 2k (3.7) 

X k 
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Note that the variance of the spectral estimate is independent of the 
record length T. That is, increasing the record length does not reduce 
the variance of the estimate; it only increases the number of spectral 
components in the spectrum. Since the record length is a measure of the 
sample size for the estimate, this implies that the PSD estimates given 
by Equation 3.3 are inconsistent . To be consistent, an estimate should 
approach the parameter being estimated with a probability approaching 
unity as the sample size becomes large. That is, for any e >0, 

lira Prob [ |G - G| > e] = 0 (3.8) 

T-*» 

Because the estimates given by Equation 3.3 are inconsistent, the 
spectrum computed from that equation is called the "raw spectrum" and the 
components of the spectrum are called the "raw" components. It will be 
shown that more nearly consistent estimates can be made by ’'smoothing 1 ’ 
the raw estimates. 

D. ERROR ASSOCIATED WITH THE SPECTRAL ESTIMATE 

The accuracy of parameter estimates based upon sample values is usu- 
ally described in terms of the mean square error defined by 

* 2 

Mean square error = E [ (G - G) ] (3.9) 

It can easily be shown that the mean square error is made up of two 
parts; the variance of the estimate (random error) and the square of the 
bias error of the estimate, that is, 

Mean square error = Var [G] + b^[G] (3.10) 

An estimate is said to be unbiased if E[G] = G. Therefore, the bias 
error of the estimate is defined as 

Bias error = b [G] = E[G] - G (3.11) 
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In Bendot and Piersol (Pg 187), the bias error for the spectral esti- 
mate is shown to be proportional to the second derivative of the true 
spectrum. Therefore, for a relatively smooth spectrum over the effective 
bandwidth of each estimate, the bias error is usually negligible com- 
pared with the random portion of the error, VAR[G] . The mean square 
error is thus approximately equal to the square of the standard error 
where the standard error is defined as 

Standard error = • VAR[G] = o[G] (3,12) 



If this error is expressed as a fraction of the quantity being esti- 
mated, it is called the normalized standard error, and is given by 



_ . a [G] 

r G 



(3.13) 



Remembering that 

E [X k 2 ]= E[G] = k « G 



and 



VAR [X k 2 ] = VAR [G] = 2k 



we have 




(3.14) 



(3.15) 



(3.16) 



And since K = 2 for the raw spectral estimate, we have a standard error 
of unity, which says that the standard deviation of the estimate is as 
large as the quantity being estimated — which is obviously unacceptable. 
The smoothing operations presented in the next section will increase the 
number of degrees of freedom associated with each estimate and thus de- 
crease the standard error considerably. 



E. SMOOTHING THE SPECTRAL ESTIMATE 

The method of reducing this error is called "smoothing" and may be 
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accomplished in one of two ways. The first way is to smooth over an 
ensemble of estimates. This can be done by computing individual esti- 
mates from q sample records and then averaging the q estimates at each 
frequency. The second way is to smooth over frequency. This can be done 
by averaging together the results of l contiguous spectral components in 

the estimate from a single sample record. In either case, the smoothing 

/ 

operation approximates the expectation operation in equation 3.1 and 
hence makes the estimates consistent. 

For ensemble smoothing, the estimates from q independent (i.e., non- 
overlapping) time slices are averaged together. If each time slice is of 
length T 1 , then the length of the original sample is T = qT* and the final 
smoothed estimate is given by 



A 

G. 

1 





+ G. ] 

i>q 



(3.17) 



where G. is the raw estimate at frequency f. of the q time slice. 
i,q 1 

Noting that x a 2 + X^ 2 = X a+ j 2 we see that the smoothed estimate will be a 
chi-squared variable with roughly 2q degrees of freedom. The smoothing 
operation did not affect the bandwidth of the estimates; hence each esti- 

. (X 

mate will have an effective bandwidth of = — , where a depends on the 
particular window being used. Hence, by ensemble averaging 



R = SL = «SL 

e T 1 T 



and 



f l 

q 



(3.18) 



(3.19) 



The filter shape of each smooth estimate will be the same as for the 
raw estimate which is determined by the Fourier transform of the original 
data window used. The estimate (L is considered as representing the 
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midpoint of the frequency interval covered by the filter (B is the effec- 
tive bandwidth of the filter). Figure 3.1 illustrates the process of ob- 
taining smooth spectral estimates using ensemble averaging. 

For frequency smoothing, the estimates of Z contiguous spectral com- 
ponents from the raw spectrum are averaged to form the smooth estimate 

(the raw spectrum referred to was computed from a time record of length 
th A 

T) . Hence, the i spectral component of the smoothed spectrum will 
be given by 



G. 

x 



Z [ G 2i+l-£ + ' 

1 2 



+ G 2i~l+fc j 



(3.20) 



where G_. is the i component of the raw estimate and it is assumed that 

A 

Z is odd. G^ will be a chi-squared variable with roughly 2Z degrees of 
freedom by the same reasoning used with ensemble smoothing. The effec- 
tive bandwidth, B 1 ^, of the smoothed estimate will depend on the value of 
l and will be given by 



B 1 = 

e 



Z - 1 -Fa 



(3.21) 



where the effective bandwidth of each raw estimate used in the average 
was = a/T. Again the value of a depends on the data window used. The 
filter shape of the smoothed estimate will be approximately trapezoidal 
as shown in figure 3.2. The process of obtaining a smooth estimate 
through frequency averaging is shown in figure 3.3. The estimate G^ is 
considered as representing the midpoint of the frequency interval covered 
by B^. A total of N / Z independent estimates can be obtained in the 
smoothed spectrum. The standard error of the estimates obtained by fre- 
quency averaging is given by 



e 

r 
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(a) Sample record segmented into three pieces 



G, 



“ H 



(b) Raw spectrum from segment one 









(e) Smoothed spectrum by ensemble smoothing with q=3 



\ 



Figure 3.1. Example of ensemble smoothing 
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For combined smoothing using both frequency and ensemble averaging, 
the final effective bandwidth of each estimate will be approximately 




and the resulting will be a chi-squared variable with roughly 2lq 
degrees of freedom. The normalized standard error e^ is given by 



e 

r 





(3.24) 



Even though the stability of the estimates may be improved by either 
ensemble or frequency smoothing or a combination of the two, it is advis- 
able to use only ensemble smoothing in a system designed to detect narrow- 
band signals. The frequency resolution of the estimated spectrum will 
generally be determined to be of the same order of magnitude as the sig- 
nal to be detected, and taking longer transforms and frequency smoothing 
would be a very inefficient use of the available resources. 



F. INCREASED STABILITY THROUGH OVERLAPPING SEGMENTS 

The term "degrees of freedom” introduced in Section IIIC as a measure 
of the stability of the estimate was said to refer to the number of inde- 
pendent or "free” squares that were summed to form the estimate. A 
slightly relaxed definition of the number of degrees of freedom of an 
estimate, called the "equivalent number of degrees of freedom", can be 
used by noting that for a chi-squared variable 



2E 2 [G] 
* Var [G] 



(3.25) 



This definition of K will be used to see if the stability of the estimates 
may be increased still further by overlapping the segments used in the 
ensemble smoothing operation. 
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(a) Filter shape before frequency smoothing 




(b) Filter shape after frequency smoothing 



Figure 3.2. Filter shape before and after freq. smoothing 
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n(t ) 




(a) Sample record used for the spectral analysis 



G(f) 






(b) Raw spectrum from transform of n(t) between 
t = 0 and t = T 




(c) Smoothed estimate by frequency smoothing with J{=3 



Figure 3.3. Example of frequency smoothing 
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Referring to the notation of section 2.2.5 and figure 2.17, we see 



that each smooth estimate is computed by 



- i r ~ 2 

G(f ) = £ l |G (f)| (3.26) 

i=l 

where G_^(f) is the finite Fourier Transform of the ith windowed section 
and P is the number of segments used. That is 

G.(f) = | n (t) W[t - | - (i-l)s)e' j2irft dt, (3.27) 

•^+(i-l)s 1 _< i <_ P 

The spectral estimate G(f) in equation 3.26 is a random variable. 

Its mean and variance were computed by Nut tall (1971) and are given by 



where 



- f 2 (2 

E[G(f ) ] = G(f - v) |w(v) | dv - G(f ) |W(v) | dv 

—00 —00 

(P-1) 2 

Var [G(f ) ] = G 2 (f) £ (1 - -^-) | ^(is) | 

i=-(P-l) 

oo 

4) (x) = [ W(t) W(t - T ) dt 



(3.28) 



(3.29) 



(3.30) 



under the assumption that n(t) is a Gaussian random process and that the 

2 

frequency resolution of the spectral window |w(f) | is narrower than the 
finest detail in the true spectrum G(f). 

Equation 3.28 shows that the mean of the spectral estimate is equal 
to the convolution of the true spectrum G(f )* with the spectral window 
|W(f) | . Equation 3.29 expresses the variance of the spectral estimate 
in terms of the number of pieces P, the shift S, and the autocorrelation 
<f> w of the data window. Using the results given in equations 3.28 and 
3.29, we can compute the equivalent number of degrees of freedom of our 
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estimate in terms of the number of pieces used and the autocorrelation 
of the data window as follows: 



- 2E 2 [g(f) ] 
Var [6(f)] 



(3.31) 



2P [ j |w(v) | 2 d v J 

= p -l I . | y 

l a-^bw^l 

i=-(p-D F w 



But 



<f> (0) = 

w 



W (t)dt 



P-l 

l (1- 

i— (P-l) 



2P 




<f> (iS) 
w 

(J) (0) 
w 



2 



(3. 32) 



(3.33) 



(3.34) 



Notice that K. is independent of the value of frequency f and true spec- 
trum G(f). It depends only on the particular data window W and the num- 
ber of pieces P. Noting that the shift S is uniquely determined as a 
function of P,L, and T, as shown in equation 2.44, we have 



K = 



2P 

P-l 

i + 2-i a - b 

i=l 



■ , . T-L. 
*w (l - P=l> 



<t> (0) 
Y w 



(3.35) 



It is interesting to note that for a given T/L, a maximum value of 
K occurs at some value of overlap less than 100 percent. The value of 
overlap is a function of the particular data window used and is relatively 
independent of the value of T/L. As described in section IIC, for 
P = T/L, K = 2P. As P increases for a given T/L, the value of K will 
increase but at a continually decreasing rate because the overlapped 
pieces are progressively more statistically dependent. K will eventually 
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reach a maximum value and then decrease slightly as the amount of overlap 
approaches 100 percent. A typical curve of K versus percent overlap is 
shown in figure 3.4. Note that the amount of overlap required to achieve 
90 or 95 percent of maximum stability is considerably less than that re- 
quired for 100 percent. The extra computational effort required for 
highly overlapped processing should be given serious consideration before 
implementing the maximum value of K. 

The rectangular data window is described by 



W(t) = u ( t ) - u(t-L) 
where u(t) is a step function. 

The autocorrelation function as defined in equation 3.30 is computed 
to be 



and 



<j>(x) = L - | T j 

<K~0 _ , jr_L 

<K0) L 



(|t| _< l) 
( M £ L) 



For the Hanning window 



W(t) = -|(1 - cos [u(t) - u(t - L)] 



thus 



, , v 1 /T v ,, , 1 2ttt x , 3 L , 2ttt 

4 >(t) = _(L-t) (1+ -cos sin— 



( | T | £ L) 



and 



(j> ( t) _ 2. w . 1 2 ttt. 1 . 2ttt 

(f)(0) 3L L t)(1+ 2 COS L ^ + 27 s111 L 



( I T < L) 



(3.36) 

(3.37) 



(3.38) 

(3.39) 



(3.40) 



Tables 3.1 - 3.4 show how K varies as a function of P for several 
values of T/L, as given by equation 3.35, for the rectangular and Hanning 
windows . 
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Of , K 




Figure 3.4. Typical K vs S curve 



*- S 
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p 

4 

5 

6 

7 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

50 

100 

200 

300 

400 

500 



RECTANGULAR HANNING 



S(%) 


K 


K 


0 


8.0 


8.0 


25 


9.09 


9.99 


40 


9.47 


11.91 


50 


9.80 


13.36 


57.1 


9.99 


14.09 


62.5 


10.05 


14.29 


66.7 


10.11 


14.28 


70.0 


10.15 


14.20 


72.7 


10.15 


14.13 


75.0 


10.16 


14.06 


76.9 


10.17 


14.00 


78.6 


10.16 


13.94 


80.0 


10.15 


13.90 


81.2 


10.15 


13.85 


82.4 


10.14 


13.81 


83.3 


10.14 


13.78 


84.2 


10.13 


13.75 


93.9 


9.98 


13.38 


97.0 


9.90 


13.26 


98.5 


9.86 


13.20 


. 99.0 


9.85 


13.18 


99.2 


9.84 


13.17 


99.4 


9.83 


13.16 


TABLE 3.1 


K vs P FOR T/L = 4 
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p 

8 

9 

10 

11 

12 

13 

14 

15 

16 

17 

18 

19 

20 

21 

22 

23 

24 

25 

50 

100 

200 

300 

400 

500 



RECTANGULAR HANNING 



S (%) K K 



0 


16.0 


16.0 


12.5 


17.51 


18.0 


22.2 


18.37 


20.0 


30.0 


18.91 


21.99 


36.4 


19.32 


23.92 


41.7 


19.69 


25.71 


46.2 


20.06 


27.27 


50.0 


20.46 


28.52 


53.3 


20.76 


29.45 


56.2 


20.95 


30.09 


58.8 


21.06 


30.49 


61.1 


21.16 


30.72 


63.2 


21.25 


30.83 


65.0 


21.35 


30.87 


66.7 


21.46 


30.87 


68.2 


21.55 


30.85 


69.6 


21.61 


30.82 


70.8 


21.64 


30.79 


85.7 


21.95 


30.30 


92.9 


21.93 


30.03 


96.5 


21.87 


29.89 


97.7 


21.84 


29.84 


98.2 


21.83 


29.81 


98.6 


21.82 


29.80 



TABLE 3.2 K vs P FOR T/L = 8 
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p 

16 

17 

18 

19 

20 

25 

30 

35 

40 

45 

46 

47 

48 

49 

50 

100 

200 

300 

400 

500 



RECTANGULAR HANNING 



S(%) 


K 


K 


0 


32.0 


32.0 


6.2 


33.75 


34.0 


11.8 


35.08 


36.0 


16.7 


36.10 


38.0 


21.1 


36.89 


40.0 


37.5 


39.37 


49.77 


48.3 


41.36 


57.67 


55.9 


42.87 


62.07 


61.5 


43.49 


63.70 


65.9 


44.05 


64.06 


66.7 


44.19 


64.070 


67.4 


44.31 


64.074 


68.1 


44.40 


64.073 


68.7 


44.48 


64.067 


69.4 


44.55 


64.06 


84.8 


45.66 


63.59 


92.5' 


45.85 


63.30 


95.0 


45.85 


63.20 


96.2 


45.84 


63.15 


97.0 


45.83 


63.12 



TABLE 3.3 K vs P FOR T/L = 16 
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p 

32 

33 

34 

35 

36 

37 

38 

39 

40 

50 

60 

70 

80 

90 

100 

101 

102 

103 

104 

105 

110 

120 

130 

140 

150 

200 

300 

400 

500 



RECTANGULAR HANNING 



S(%) 


K 


K 


0 


64.0 


64.0 


3.1 


65.88 


66.0 


6.1 


67.52 


68.0 


8.8 


68.96 


70.0 


11.4 


70.22 


72.0 


13.9 


71.32 


74.0 


16.2 


72.30 


76.0 


18.4 


73.16 


78.0 


20.5 


73.93 


80.0 


36.7 


79.08 


99.61 


47.5 


83.16 


115.85 


55.1 


86.53 


125.43 


60.8 


87.94 


129.31 


65.2 


89.11 


130.36 


68.7 


90.29 


130.52 


69.0 


•90.37 


130.53 


69.3 


90.43 


130.53 


69.6 


90.49 


130.52 


69.9 


90.55 


130.52 


70.2 


90.61 


130.52 


71.6 


90.84 


130.50 


74.2 


91.29 


130.45 


76.0 


91.82 


130.40 


77.7 


92.08 


130.35 


79.2 


92.29 


130.30 


84.4 


93.07 


130.11 


89.6 


93.55 


129.92 


92.2 


93.70 


129.82 


93.8 


93.75 


129.76 



TABLE 3.4 K vs P FOR T/L = 32 
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G. CONFIDENCE INTERVAL FOR THE SPECTRAL ESTIMATE 



Since the sampling distribution of the smoothed estimate is approxi- 
mately chi-square with K degrees of freedom, we have a (1 - a) confidence 
interval for a power spectral density function G(f) (based upon an esti- 
mate G(f) measured with a resolution bandwidth and a record length T) 
given by 



_|k«_ < G(£ , < 2 mn ' 

X K: a/2 X K: 1 - a/2 

1 



(3.41) 



The confidence statement is interpreted as follows: The true PSD G(f) 

will fall within the noted interval with a confidence coefficient of 
(1 - a), or in more common terminology, with a confidence of 100(1 - a) 
percent . 
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IV. PROCESSING GAIN OF A FOURIER PROCESSOR WITH INTEGRATION 



A. INTRODUCTION 

Section III discussed the spectral estimate by the use of the fast 
Fourier transform technique and the method of improving the estimates for 
a given record of finite length. It was shown that the measure of 
quality or reliability of the estimate is the mean square error (or vari- 
ance) and that this error can be made arbitrarily small if a longer' 
record were available. This situation often creates a three-way com- 
promise among effort, resolution and stability (precision) of the 
estimate . 

This section will interpret the stability of the spectral estimates 
in terms of processing gain within the context of a Fourier signal pro- 
cessor. Figure 4.1 shows a model of the system. Its mission is to de- 
tect the presence in a random noise background of discrete narrowband 
signals with random phase variation. The decision element of the syetem 
is assumed to be a human operator monitoring a display similar to the 
one discussed in section II c. The sensitivity of the processor is us- 
ually defined in terms of a minimum detectable signal for a given pro- 
bability of detection and false alarm. Because of the presence of the 
human operator in the system, the level of minimum detectable signal is 
difficult to determine and will not be discussed. A brief look at the 
theory of signal detection by a human operator will be coverd in section 
V. It should be noted that the "processing gain" discussed in this sec- 
tion is derived under rather idealized conditions. The effects of 
redundency and Hanning weighting are considered but losses due to hard- 
ware limitations are not discussed. Losses due to actual implementation 
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Figure 4.1. Fourier processor with integration 



68 



have been found to be small if careful consideration is given to the 
areas discussed in section II. 

B. PROCESSING GAIN DEFINED 

When talking about processing gain of a Fourier signal processor, it 
is convenient to view the DFT as a bank of N independent narrow band 
filters, where N is the number of equally spaced samples (each of width A) 
within the time of L the data to be transformed. The center of each 
"filter 11 will be located at multiples of the fundamental frequency 
f = “ Hz and the shape of each filter will depend upon the data window 
used as previously discussed. Regardless of which window is used, however, 
maximum power transfer will occur only if the desired signal if a multiple 
of the fundamental frequency. If the signal is not an even multiple of 
f Q , then it will be attenuated and its energy spread throughout the 
spectrum (the extent of the leakage will again depend on which window 
is used) as shown in figure 2.12. 

The processing gain (PG) of a digital processor which utilizes the 
DFT as the initial filtering device and then incoherently integrates 
the output of each filter (i.e., ensemble smoothing) will be made up of 
two distinct parts, one part due to the DFT itself (PG r p and one part due 
to the incoherent integration (PG^.) . Heuristically , this makes sense, 
since the signal-to-noise power ratio (SNR) will obviously be improved 
by passing the signal through a narrowband filter. And then the results 
can be improved even more by averaging the random noise power over time 
since the average signal power is approximately constant. Therefore, 
the processing gain for the system is defined as 

PG = PG t • PG ]; (4.1) 
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For simplicity, the processing gain will be derived in terms of a 
single discrete frequency component in an additive white gaussian noise 
background. The assumption of a white noise background is really not 
very restrictive since all that is required is that the power spectrum 
of the noise be flat over the bandwidth of each filter. Further, it 
will be assumed that the signal to be "detected" will be an integer 
multiple of the fundamental frequency, f^= 1/L, so that maximum power 
transfer is achieved by the filter. This is a reasonable assumption 
since we are interested in the maximum gain available from the system. 
Also, it will be assumed that the signal is stable in frequency through- 
out the interval of observation and that it is actually present in the 
background during that time. Exceptions to the above assumptions will 
be discussed later. 

The processing gain of each section of the processor will be de- 
fined as the ratio of the signal- to-noise power at the output of the 
section to the signal-to-noise power at the input of the section. For 

standardization, the noise power used is usually the average power of 

th 

the noise in a 1 Hz band. Thus for the j filter of the DFT (see 
figure 4.2), 



SNR 

pp — 

r T , j SNR X 



( 4 . 2 ) 



and for the integrator 



PG 



SNR, 

4 

I,j SNR 



( 4 . 3 ) 



th 

Notice that the signal-to noise ratio at the output of the j filter 
is the input signal-to-noise ratio to the integrator (i.e., SNR^ = SNR^) . 

Ihere£0re> SNR, SNR. SNR, 

pp . pp ~ . ft = fL = pg 

T,j rto I,j SNR X SNR 3 SNR x j 

the processing gain for the entire system. 
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( 4 . 4 ) 



. snr 1t 




SNR 0 SNR 2 




INPUT 


FFT 


’ yp 


INTEGRATE 



SNR , 



OUTPUT 



Figure 4.2. Reference for signal-to-noise ratios 
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For purposes of the following analysis, only the processing gain of 
a typical filter will be derived. Results for the remaining N-l filters 
of the processor may be generalized from this one by considering whether 
or not a signal is present within the bandwidth of the filter. 



C. PROCESSING GAIN OF THE DFT (PG .) 

y 3 

th 

The j filter output of the DFT is defined as 



c. 

3 



1 

N 



N-l 



I s 

k=0 



-i2irik/N 

k 6 



(4.5) 



where k = 0, 1,2 , . . . ,N-1 are the N sample values of the input signal 

to be transformed. 

th 

With noise only present at the input, the output of the j filter 
may be written as 



c . 
J 



1 

N 



N-l 



l 

k=0 



-i2irjk/N 

n k 6 



(4.6) 



where n^, k = 0 , 1,2 , . . . ,N-1 are sample values from a zero mean Gaussian 
random process. In terms of the real and imaginary coefficients, the 
complex quantity c^ can be written as 

c. = a. + i b . (4.7) 

J J J 



where •• 



N-l 

• a. = Re(c.) = — £ n, cos (2irjk/N) 

3 3 N k=0 k 

N-l 

b. = Im(c . ) =— £ n sin (2irjk/N) 

3 3 N k=0 K 



(4.8a) 

(4.8b) 



Now since the DFT is a linear operator, the random variables a^ and b^ 
are each zero-mean Gaussian random variables when the input signal is 



noise alone. To show that a. and b. are also independent, we need only 

J 3 

to show that E[a.b.] = 0. 

J 3 
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Now 



N-l 

E[a>J = E[(i l 



3 3 



k=0 



N-l 

n cos 27rjk/N) (- - l n 

k M Jj-Q 



sin 2uj £/N) ] 



(A. 9) 



N-l N-l 

= - - l l E[n n ] cos(2Trjk/N) sin (27rj £ /N) (4.10) 

N k=0 £=0 



= - — 9 l E[n, 2 ] cos (2rjk/N) sin(2iTjk/N) (4.11) 

N k=0 

2 N-l 

~ ~ ~~n l sin (4iTjk/N) . (4.12) 

N k=0 



Efa^bJ = 0 (4.13) 



where a is the variance (average power) of the input noise signal. 
Therefore, a^ and b^. are independent zero-mean Gaussian random variables. 
They also have equal variances as shown below: 



Var [a. ] 
1 



And 

Var [b^ ] 



2 , N-l N-l 

E [a ] = E[(- l cos 2irjk/N) (£ £ n cos 27rj£/N) ] 

2 W k=0 k W SL=0 * 

N-l N-l 

= E t _ o l l n.n cos (2irjk/N) cos (2 ttj£/N)] 

N k=0 £=0 

1 N-l ‘ 2 2 

= — O l Efn ] cos (2iTjk/N) 

N k=0 

2 N-l 

= -^2 1 1 + cos (4irjk/N) 

2N k=0 



2 

a 

2N 



2 N-l N-l 

E [b J = E [ (— l n sin 2TTjk/N) (- £ n sin 2nj£/N)] 

J W k=0 w 1=0 * 



(4.14) 

(4.15) 

(4.16) 

(4.17) 

(4.18) 

(4.19) 
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(4.20) 



N-i 2 2 

= - 9 I Efn ] sin (2irjk/N) 
N k=0 

2 N-l 

= -^2 1 1 ~ cos (2irjk/N) 

2N k=0 



2 

a 

2N 



(4.21) 



(4.22) 



ttl 

Now the power spectral component of the j filter is defined as 

(4.23) 



A 2 2 
G. = a. + b. 

3 3 J 



and the average power in that bin is therefore given by 

Avg Pwr = E[G.] = E[a. 2 ] + E[b. 2 ] 

3 3 3 



(4.24) 



o 

N 



(4.25) 



Now if a signal is present, the sample values s , k = 0,1,2, ... ,N-1 

K. 

are each the sum of a zero mean Gaussian random variable and a constant 

t h 

A cos (2irfkA + <j>). So the j filter output can be written as 

N-l 



C . = tt £ [n, + A cos (2irfkA + <{>)]e 

2 N k=0 k 



-i2-njk/N 



(4.26) 



1 N r‘*‘ -i2irjk/N 1 -i2irjk/N 

e J + — }, A cos (2ufkA + <J>)e J 

N k=0 



j = N ^ n k 
2 w k=0 K 



(4.27) 



which can be seen to be the sum of two independent parts, one due only to 
the noise and one due only to the signal. The noise portion was handled 
above. Consider now only the portion of the output due to the signal. 
Then, 

C. = 4 I cos (27TfkA + 4.)e“ l2TrJk//N (4.28) 

J’ S N k=0 

A i(2irfkA + <J>) -i(2nfkA + <{>) n -i2 tt jk/N (4.29) 

2N c *- e +e ie 

k=0 
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Now under the assumption that the signal is in the center of the bin we 
have f A = j /N. So 



C. 

3 , s 



_A 

2N 



N-l 



l 

k=0 



A id) 
— e 



e i<j> + e ~i (4iTjk/N + <J>) 



(4.30) 



The total output of the j bin with signal plus noise at the input is 
thus given by 



N-l 

C, = £ e*'*' + ± l n. e 



„ = A ^ 4- I 

j 2 



-i27rjk/N 



N vto “ k 



(4.32) 



A 1 N_1 

= (tt cos <J> + — Y n, cos 2irjk/N. . .A , 1 r> . 0 .. 

2 N k ^ Q k J ) -l^ sm «j> + — £ n k sin 2ivjk/N) 



k=0 



The power spectral component at frequency j /N is given by 

G. = |C.| 2 
J J ' 



(4.33) 



(4.34) 



A 2 N-i • N-l N-l 

G i = T + N cos ^ l n, (2njk/N) + - sin <p l n sin(2Trjk/N) + — £ n 

3 ^ k=0 K W k=0 k N k=0 k 

(4.35) 

and the average power at that frequency is given by 

A 

Avg Pwr = E[Gj] (4.36) 

A 2 a 2 

-r + r (i, ' 37) 

* th 

since EfN^J = 0. Notice that the average power in the j bin is made up 
of two components, one due only to the noise and one due only to the sig- 
nal. This is not an unexpected result since the noise and the signal were 
uncorrelated and therefore, the power should be additive. 
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Referring to equation 4.2, we are now ready to compute PG .. The 

> J 



signal-to-noise ratio of the input signal is given by 

A 2 

SNR = -^2 
2a 



(4 . 38) 



2 2 
where a is the average power (variance) of the noise and A / 2 is the 

RMS signal power. From 4.37 it is obvious that 



snr 2 = -Mr = | snr 2 



4a 

SNR 



therefore PG_ . = 



2 N 



T , j SNR 1 2 



(4.39) 

(4.40) 



which is a realistic result since the original noise power has been ef- 
fectively separated into N/2 bins (see symmetry conditions of the DFT 
in section IIB) and the average sigan] power is totally contained within 
the j bin. 



D. PROCESSING GAIN OF THE INTEGRATOR AND TOTAL SYSTEM GAIN 



. th 



The processing gain due to incoherent integration in the j bin is 
due entirely to the variance reduction (or smoothing) of the noise. This 
may be seen as follows: 



A /4 

SNR = SNR = - 9 - 

o 3 2 /N 



(4.41) 



and 



SNR = - A / - 4 

4 ,X7 

/ N 



(4.42) 



, 2,2 , . . th , . 

where a„ and a. are the variances in the i bin due to the noise at 
j 4 

stages three and four, respectively, of figure 4.2. Referring to Equation 
4. 3 we see that 



PG 



SNR. o. 
4 3 



I.j SNR, 



(4.43) 
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The variance of the smoothed spectral estimate was computed by 
Nuttall (1971) and is given by 



2 (P-1) I , 

Var[G] = —4^' l (1 — p~) I ( i S ) 
P i=-(P-l) P w 



= ^r 1 {^w ( 0 ) l 2 + 2 ^ 1 ( 1 -F ) ^w (ls) 

\ i“l 



(4.44) 

(4.45) 



where G(f) = true spectrum of noise 

<f> (t) = autocorrelation function of the data window 
w 

P = number of raw transforms averaged to obtain 
the estimate 

S = amount of overlap used in the P transforms in 
seconds . 

The reader is referred to figure 2.17 and section III F for a review of 
how the above parameters relate. 

Now, for the raw spectral estimate (i.e., estimate available at stage 
3 of figure 4.2) P = 1 and we have no overlap. Therefore 



0 



2 

3 



- G 2 (£)H u (0)| 



(4.46) 



For the smooth spectral estimate (i.e., estimate at the output of 
the integrator) , P>»1 and we have 



r 2,_,/ 2 P-1 

^'U(0)| + 2 1(1 - |) U w (iS) 



Therefore 



PG t • 

I.J 



1 + 2 l ( 1 - |) 

i=l 



<t> (IS) 2 



w 



V 0) 



(4.47) 

(4.48) 



(4.49) 
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where K is the equivalent number of degrees of freedom of the estimate 
(see section 3). Note that if P = T/L, there is no overlap and PG^ P. 

For P > T/L, the processing gain will be greater than P. Combining 
equation 4.49 and equation 4.40, the system processing gain is given by 




(4.50) 



To maximize the system processing gain for a given T/L, we want to 

overlap the transforms so as to maximize the value of K. Figure 4,3 

shows that K is linear as a function of T/L if the Hanning window- 
max 

is used on the data. Similar results could be obtained for any data 

window using equation 3.35. Figure 4.4 shows the amount of overlap 

required to obtain K , .99K and .95K as a function of T/L. 

max max max 

Note that while the amount of overlap required to obtain K increases 

max 

slightly with T/L, the overlap required to obtain .99K or .95K 

max max 

is nearly independent of T/L. The slight discontinuities in figure 4.4 
at small values of T/L are due to the large percentage overlap changes 
with the addition of one piece (or transform). Thus, for the Hanning 
window it would seem that 60 % overlap would be the optimum overlap 
from a processing gain point of view. Figure 4.5 shows how the value of 
K varies as a function of T/L if other than optimum overlap is used. 

From figure 4.3 we see that 



K 

max 



T 

4.15 (f) 



2.20 



(A. 51) 



So 



PG = N(1.0375 T/L - .55) 



(A. 52) 



or in decibels. 



PG = 10 log 1() N + 10 log 10 (1.0375 T/L - .55) db (A. 53) 
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Figure 4.4. Percent overlap vs T/L for Hanning window 
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Usually one would expect N>>T/L, so that the first term of equation 
will be the most dominant factor in figuring the processing gain. For 
example, if N = 8192 and T/L = 16, 

PG = 39.13 + 12.05 » 51.18 db (4.54) 

E. NON- IDEAL CONDITIONS 

In the above discussion of processing gain, it was assumed that the 
signal frequency remained constant over the entire period of observation. 
If this is not the case, then both PG^ and PG^. will be less than ideal. 
For example, in the case of the transform, assume that the input fre- 
quency is f - j/L for the first N/2 samples and then abruptly jumps to 
f = (j+l)/L for the remaining N/2 samples. Then, the processing gain 
(PG^) for both the j^ and (j+l) St filters is N/8, or a loss of 6 db 
over the ideal case. 

Next assume that the frequency is a constant f = j /L for t seconds 

of the total T seconds of integration, and then abruptly jumps to f 3 

(j+l)/L for the remaining T-t seconds. Then the average power in the 

j bin will be attenuated by a factor of t/T and thus PG^. for the j th 

bin will be attenuated by a factor of t/T (i.e., a loss of 10 log^Ct/T) 

s t 

db) frpm the ideal case. The loss from the ideal case for the (j+1) 
bin will be 10 -log ^ (T-t)/T db. Thus we see that if the integration 
time is significantly larger than the time the signal spends in the bin, 
then we can quickly eliminate any gain we might have obtained through 
the DFT. For example, if T = 256 and t = 80, the loss in the j^ bin 
amounts to 15 db over the ideal case. Careful condideration should 
therefore be given to the integration time and frequency resolution to 
to use for the particular problem at hand. Very fine resolution and a 
long integration time will not aid in detection of a signal which is 
unstable in frequency and possibly fading in and out besides. 



V. SIGNAL DETECTION AND THE HUMAN OPERATOR 



A. INTRODUCTION 

In the mid-1950 , s the theory of statistical decision, motivated by 
problems in radar, was translated into a theory of signal detection. 
Although the theory specified an ideal process, the generality of the 
theory suggested that it might also be relevant to the detection of sig- 
nals by human observers. The relevant theory seemed to provide a frame- 
work for a realistic description of the behavior of the human observer 
in a variety of perceptual tasks. A brief overview of the general the- 
ory and a description of certain elements of the signal detection theory 
appropriate to the human observer are included here to point out the 
flexibility that a human operator can provide in a signal detection 
system. 

Statistical decision theory specifies the optimal behavior in a 
situation where one must choose between two alternative statistical 
hypotheses on the basis of an observed event or series of events. In 
particular, it specifies the optimal cutoff, along the continuum on 
which the observed events are arranged, as a function of (a) the a 
priori probabilities of the two hypotheses, (b) the values and costs 
associated with the various decision outcomes, and (c) the amount of 
overlap of the distributions that constitute the hypotheses. In short, 
the detection problem is a problem in statistical decision; it requires 
testing statistical hypotheses. 

B. FUNDAMENTAL DETECTION PROBLEM 

In the fundamental detection problem, an observation is made of 
events occurring in a fixed interval of time, and a decision is made, 
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based on this observation, whether the interval contained only the back- 
ground interference or a signal as well. The interference, which is 
randora > is referred to as noise and is denoted by N; the other alterna- 
tive is termed signal plus noise and is denoted by SN. In the funda- 
mental problem, only these two alternatives exist — noise is always pre- 
sent, whereas the signal may or may not be present during a specified 
observation interval. Actually, the observer, who has advance knowledge 
of the ensemble of signals to be presented, says either n yes, a signal 
was present" or "no, a signal was not present" following each observa- 
tion. The fundamental problem will later be extended to include a third 
alternative - "I'm not sure, continue", but for now consider only the 
fundamental case. 

C. REPRESENTATION OF SENSORY INFORMATION 

Any observation may arise, with specific probabilities, either from 
noise alone or from signal plus noise. This is shown graphically in 
figure 5.1. The observation is labeled x and plotted on the abscissa. 

The left-hand distribution, labeled f^(x) , represents the probability 
density that x will result given the occurrence of noise alone. The 
right-hand distribution, f^^(x) , is the probability density function of 
x given the occurrence of signal plus noise. Since the observation will 
tend to be of greater magnitude when a signal is presented, the mean of 
the SN distribution will be greater than the mean of the N distribution. 
In general, the greater the amplitude of the signal, the greater will 
be the separation of these means. 

D. OBSERVATION AS A VALUE OF A LIKELIHOOD RATIO 

In the discussion thus far it has been assumed that the observation 
may be represented along a single axis. That this is a valid assump- 
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Probability Density 




Figure 5.1. The probability density function of noise 
and signal plus noise. 
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tion — in spite of the fact that the response of the visual system pro- 
bably has many dimensions - is explained in terms of some concepts that 
are basic to the theory. 

Assume that the response of the sensory system has several dimen- 
sions, and represent it as a point in an m-d imens ional space. Call this 
point y. For each such point in this space there is some probability 
density that it resulted from noise alone, f^(y), and similarly some 
probability density that it was due to signal plus noise, f^^(y) . 
Therefore, there exists a likelihood ratio for each point in the space, 

X (y) - f f^(y) > expressing the likelihood that the point y arose 
from SN relative to the likelihood that it arose from N. Since any 
point in the space, i.e. any sensory datum, may be thus represented as a 
real, non-zero number, these points may be considered to lie along a 
single axis. The observation x is thus identified with X (y) and the 
decision axis becomes the likelihood ratio. 

The assumption of a unidimensional decision axis is independent of 
the characteristics of the signal and noise. Rather, it depends on the 
fact that just two decision alternatives are considered. More generally, 
it can be shown that the number of dimensions required to represent the 
observation is M-l, where M is the number of decision alternatives con- 
sidered by the observer. 

Having established that the observation x may be identified with the 
likelihood ratio X (y) , it can also be shown -that we may equally well 
identify x with any monotonic transformation of X (y) ; i.e. nothing is 
lost by distorting the linear continuum as long as order is maintained. 

In particular, if it is assumed that a transformation of X (y) results in 
Gaussian density functions on x, the problem will be greatly simplified 
as will be seen shortly. The existence of such a transformation was 
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assumed in the representation of the density functions, f^(x) and 
f^(x) in figure 5.1. A further assumption is that the SN function is 
simply a translation of the N function; i.e., that the two density 
functions have equal variances. This has been found to be not strictly 
true, but for the purpose of this discussion the small difference in the 
variances will not be relevant. 

To summarize the last few paragraphs, it has been shown that an 
observation may be characterized by a value of a likelihood ratio, 

X (y) , i.e., the likelihood that the response of the sensory system y 
arose from SN relative to the likelihood that it arose from N. This 
permits viewing the observations as lying along a single axis. It was 
then assumed that a particular transformation of X (y) would result in 
normal probability density functions on x. The observer is assumed to 
base his decisions on the variable x. 

E. DEFINITION OF THE CRITERION 

From the definition of the problem we see that the observer attemp- 
ting to detect signals in noise is faced with two alternate hypotheses. 
He must decide from which hypothesis the observation resulted, N or SN. 
The observer must establish a policy that defines the circumstances 
under .which the observation will be regarded as resulting from each of 
the two possible events. He will establish a criterion, a cutoff x^ on 
the continuum of observations, to which he can relate any given observa- 
tion x.. If he finds for the i ^ observation, x. , that x. > x , he says 
l 5 l c* J 

11 SN 1 1 ; of < x^, he says "N". 

The observer's decision will be one of four outcomes: he will say 

"SN" or ,, N" and will in either case be right or wrong. The decision 
outcome, in other words, will be a "hit", a "miss", a "correct rejec- 
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tion”, or a ’’false alarm”. Clearly, these four possibilities are inter- 
dependent. For example, an increase in the probability of a hit can be 
achieved only by accepting an increase in the probability of a false 
alarm and decrease in the other probabilities. Thus, the balance de- 
sired by an observer in any instance will determine the optimal location 
of his criterion, . The relationship of these four probabilities to 
one another is shown in figure 5.2, where the probability of a ’’hit” is 
represented by region f^ • A, the probability of a ’’miss” by f • B 
(or (3), the probability of a ’’correct rejection” by f^ • B, and the pro- 
bability of a ’’false alarm” by f^ • A (or O'), 

There are two conditional probabilities of principal interest. These 
are the conditional probability of the observer saying ”SN” given the 
occurrence of a signal, P^(A), and the probability of saying ”SN” given 
the occurrence of noise alone, P^(A). These two are sufficient since the 
other two are merely their complements: P^(B) = 1 - P^(A) and P^(B) = 

1 - P^(A). The conditional and joint probabilities are related as 
follows : 



p sn ( a > 



p (SN • A) 
P(SN) 



(5.1) 



V A) 



p(N » A) 
P(N) 



(5.2) 



where p(SN) is the a priori probability of signal occurrence and p(N) = 

1 - p (SN) is the a priori probability of occurrence of noise alone. 

With reference to figure 5.2, note that P^(A) is the integral of f^^(x) 
over the critical region A and P^(A) is the integral of f^(x) over A. 

In statistical decision theory, and in the theory of signal detecta- 
bility, the optimal criterion for a decision is specified in terms of the 
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Figure 5.2. The relationship between the four possible 
outcomes of a simple detection - decision. 
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likelihood ratio. That is to say, it can be shown that the optimal cri- 
terion can always be specified by some value 0 of X (x) . In other words, 
the critical region A that corresponds to the criterion contains all 
observations with likelihood ratio greater than or equal to 0, and none 
of those with likelihood ratio less than 9. To see that this is true, 
note that the expected value of a decision (denoted EV) is defined in 
in statistical decision theory as the sum, over the potential outcomes 
of a decision, of the products of the probability of outcome and desira- 
bility of outcome. Thus, using the notation V for positive individual 
values of the observer and K for costs on negative individual values, 
we have the following equation: 



EV = V SN-A P(SN * A) + V N-B P(N ‘ B) " K SN-B P(SN ‘ B) • h-A P(N ‘ A) (5<3) 



If a priori and conditional probabilities from equations 5.1 and 5.2 
are substituted for the joint probabilites in equation 5.3, then collec- 
ting terms yields the results that maximizing EV is equivalent to maxi- 
mizing 



where 



P SN^ 6 P n( A ) 

e = _l£nL_ . ( v n-b + Va) , 
■ p(SN) \SN* A + K SN.b) 



(5.4) 

(5.5) 



It can be shown that this value of 0 is equal to the value of the like- 
lihood ratio that corresponds to the optimal criterion. From equation 
5.4 it is seen that the value 0 simply weights the hits and false alarms 
and equation 5.5 shows that 0 is determined by the a priori probabili- 
ties of occurrence of signal and of noise alone and by the values asso- 
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ciated with the individual decision outcomes. Referring to figure 
5.2 and keeping in mind the result that 0 is a critical value of X (x) , 



it should be clear that the optimal cutoff x^ along the x axis is at the 



The value of 0 above specifies the optimal weighting of hits rela- 



axis corresponding to 0. Just where this value of x^ will be with re- 
ference to the x-axis depends not only upon the overlap of the two den- 
sity functions (i.e., upon the signal strength). 

F. RECEIVER OPERATING CHARACTERISTICS 

Whatever criterion the observer actually uses, even if it is not an 
optimal criterion, it can be described by a single number — some value 
of likelihood ratio. By observing how the observer f s performance is 
evaluated with respect to the location of his criterion, it can be seen 
how his performance may be evaluated with respect to his sensory capa- 
bilities . 

As previously noted, the fundamental quantities in the evaluation of 
the operators performance are p^(A) and p s ^(A). Figure 5.3 shows a 
graph of Pg^j(A) versus p^(A), each arc representing a different amount 
of overlap. Each curve in figure 5.3 corresponds to a given separation 
between the means of the density functions f^(x) an< 3 ^ le P ara ” 

meter of these curves is labeled d ! , where d 1 is defined as the differ- 
ence between the means of the two density functions expressed in terms 
of their standard deviation, that is: 



point on the axis where the ratio of the ordinate value of f^^(x) to the 
ordinate value of f^(x) i s a certain number, namely 0. 



tive to false alarms; x should always be located at the point on the x- 




SN 



(x) - y f (x) 



N 



(5.6) 



a 
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If the decision criterion is set at the left of figure 5.2, we will 
obtain a point in the upper right-hand corner of figure 5.3; both Pg^(A) 
and p^(A) are unity. If the criterion is set at the right end of the 
decision axis in figure 5.2, a point at the other extreme of figure 5.3 
results; both p^(A) and p^(A) are equal to zero. Between these two 
extremes lies the criterion of practical interest. The form of the 
curves in figure 5.3 is not the only one which might result, but it is 
the form which will result if the observer chooses a criterion in terms 
of likelihood ratio, and the probability density functions are normal 
and of equal variance. 

In the context of the detection problem, figure 5.3 is usually re- 
ferred to as the receiver-operating-characteristic, or ROC curve. The 
optimal "operating level 11 is the point on a curve where its slope is 
equal to 0 (see equation 5.5). That is, the expression p^(A) - 0p^(A) 
defines a utility line of slope , and the point of tangency of this 
line to the ROC curve is the optimal operating level. Thus the theory 
specifies the appropriate "hit" probability and "false alarm" probability 
for any definition of optimum and any set of parameters characterizing 
the detection situation. 

Since the separation between the means of the two density functions 
is a function of signal amplitude, d’ is an index of the detectability 
of a given signal for a given observer. The variable d* is a measure 
of the observer’s sensory capabilities or the effective signal strength. 
The criterion that is employed by the observer, which determines the 
p (A) and p (A) for some fixed d 1 , reflects the effect of variables 

IN O 

which have been variously called the set, attitude, or motives of the 
observer . 
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Figure 5,3. Typical ROC curves 
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G . SEQUENTIAL OBSERVATIONS 



If additional observations are allowed before a decision is required, 
then the observer is assumed to adopt two criteria, or three critical 
regions. These regions, at least those pertaining to the first observa- 
tion, are represented as A, B, and C in figure 5.4, If the first obser- 
vation falls in A the observer says "SN", if it falls in B he says "N", 
and if it falls in C he says "Continue". The theory of sequential analy- 
sis assumes that the successive observations are integrated and that the 
integrated value is compared with the three critical regions at each 
observation stage. Thus, a decision is made whenever the combined evidence 
for one or the other of the two hypotheses is sufficiently persuasive. 

It is clear that the relationship depicted in figure 5.4 holds only for 
the first observation in a sequence. A second observation is required 
only if the first falls in the intermediate region, C. The combination 
of the first and second observations, therefore, will not be normally 
distributed , for it results from the addition of a normal variable and 
a normal variable with both tails missing. Analysis of the inter-rela- 
tionships of the parameters is extremely difficult in the sequential 
observation problem because of this. 

H. USE OF IDEAL DESCRIPTIONS AS MODELS 

At first glance it might appear that a theory of ideal behavior is 
really not very useful when talking about a human observer. This is not 
the case, however; ideal behavior constitutes a convenient base from 
which to explore the complex behavior of a real organism. Ideal con- 
ditions generally involve few variables. Having identified the per- 
formance to be expected under ideal conditions, it is possible to extend 
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Figure 5.4. Representation of the sequential decision 
problem. 
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the model to include additional variables associated with the real 
world. In studies using human observers in the signal detection pro- 
blem, it has been found that the ideal theory closely approximates 
human behavior. The problem is one of altering the ideal model in 
some way so that it is slightly less than ideal. 

The human observer is able to adjust his detection criterion to a 
particular problem if the information is presented to him in the correct 
manner. That is, the learning capabilities of the human observer can 
provide a great deal of flexibility to a signal detection system if he 
is used correctly. The value of this flexibility will be beneficial 
however, only if the values of the observer are such that they complement 
the system. The attitude and motives of the observer play a very impor- 
tant role in the effectiveness of the system. Recent advances in digital 
processing techniques and computer graphics show great promise in con- 
trolling these parameters through increased processor gain and a man- 
machine interface never before possible with analog equipment. 
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VI. CONCLUSIONS AND RECOMMENDATIONS 



This paper has discussed digital signal processing and spectral an- 
alysis techniques as applied to the detection of discrete narrowband sign 
als in a nonstationary noise background. Specifically, the discussion 
was slanted toward application of a Fourier processor with integration 
to the Navy*s ASW effort. There is a real need today for improved proces 
sing techniques and it is apparent that digital processors are the most 
likely means to improvement. The fast Fourier transform algorithm has 
led to extensive revision in the methods of frequency domain analysis 
and real-time digital processors to the point that bandwidths that are 
sufficient for most problems are a reality today. 

It appears that the first attempt at using interactive graphics tec- 
hniques in the display of a passive ASW signal processor is being made in 
the "Spotlight 11 processor. Preliminary use of the display indicates that 
the operator now has a flexibility and an insight into the detection 
problem never before possible with previous displays. This particular 
combination of processing and display has tremendous possibilities in the 
field of ASW today. 

There are two questions fundamental to the ASW detection problem, 
which the author feels are of primary importance in determining the per- 
formance capabilities of a Fourier signal processor such as the "Spot- 
light" processor. The first is the question of the effects on the 
spectral estimates of integrating for relatively long periods of time 
over information which is known to be from a non-s tationary background. 
The second is the human factors question of what is the optimal amount 
of redundancy required in the display for a processor such as this. 
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Answers, or at least partial answers, to these questions would aid 
tremendously in the actual implementation of this type of equipment. 
Research in these areas is highly recommended since it is only after 
these questions are answered that the full potential of such a system 
can be evaluated. 
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these sections, and averaging these "raw" estimates. It is shown that 
more stable estimates are available if the segments are overlapped and 
an optimum amount of overlap for the case of the Hanning window is 
found. It is shown that the stability of these spectral estimates can 
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signal in additive noise. And finally, a brief description of signal 
detection theory applied to a human observer is presented to emphasize 
the flexibility that a human operator can bring to a signal detection 
system. 
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